Stern-Gerlach Experiment
The Stern-Gerlach experiment is a fundamental experiment in quantum mechanics that demonstrates the quantization of angular momentum. It's results are a direct contradiction to classical mechanics. This page is highly inspired by the section on the Stern-Gerlach experiment in Sakurai's textbook, as well as the Eigenchris video on Jones vectors.
Table of Contents
The Setup and its Results
Imagine a beam of silver atoms traveling through a magnetic field. In classical mechanics, we would expect the magnetic field to exert a force on the atoms, causing them to deflect.
Stern-Gerlach Experiment
The Stern-Gerlach experiment was designed to test this prediction. In the setup, a furnace heats up silver atoms (1 in the image). They escape through a small hole and travel through a collimator (2), which ensures that the beam is narrow and well-defined. Then, the beam passes through an inhomogeneous magnetic field, which causes the atoms to deflect. (An inhomogeneous magnetic field is a magnetic field that varies in strength and direction over space.)
The electronic configuration of silver atoms is
Below, I will provide two explanations for the classical prediction for the Stern-Gerlach experiment: one using some mathematical machinery and the other using an intuitive analogy.
- Mathematical Explanation
- Intuitive Explanation
Let the electron spin angular momentum be
The energy of the interaction between the magnetic moment and the magnetic field is given by:
Therefore, the force on the atom is given by the gradient of the energy:
If we assume that the magnetic field is in the
According to classical mechanics, since the atom can have any orientation of the magnetic moment, the
Magnetic forces appear when charge is moving. For example, when a current flows through a wire, a magnetic field is generated around the wire. This is how electromagnets work; you coil a wire around a piece of iron, and when you pass a current through the wire, the iron becomes magnetic.
In the Stern-Gerlach experiment, the south-pole of a magnet is placed at the top, above the north-pole of another magnet. The key point is that the magnetic field is inhomogeneous, meaning it varies in strength and direction over space. Specifically, the south-pole part is much stronger than the north-pole part. This means that the magnetic field is stronger at the top than at the bottom.
Suppose you throw a bar magnet into a SG apparatus setup. If it has its north pole pointing up, there would be a strong attraction towards the south pole at the top, and a weaker attraction towards the north pole at the bottom. Hence, the bar magnet would be pulled up.
Conversely, if the bar magnet has its south pole pointing up, there would be a strong repulsion from the south pole at the top, and a weaker repulsion from the north pole at the bottom. Hence, the bar magnet would be pushed down.
If the bar magnet is tilted slightly, it would experience a force that is a combination of the two. Overall, its path would be curved, and it would hit the screen at a point that corresponds to the combination of the two forces. Since you can continuously tilt the bar magnet, you would expect to see a continuous distribution of spots on the screen.
During the times of the SG experiment, the Bohr model of the atom was prevalent.
In it, the electron orbits the nucleus in a circular path.
Silver atoms have one unpaired electron in the 5th shell - this means that there is charge moving in a circular path.
And as I said, a moving charge experiences magnetic forces. In fact, we can model the entire atom as a tiny bar magnet.
Therefore, we would expect the beam to spread out in the
Instead, what happens is that the beam splits into two distinct beams, and only two spots are observed on the screen (5 in the image).
This means that in reality, the magnetic moment can only have two values:
This is a direct contradiction to classical mechanics, which predicts that the magnetic moment can have any value between
Sequential Stern-Gerlach Experiments
Now, let's consider what happens when we pass the beam through another Stern-Gerlach apparatus. The goal is to understand how passing the beam through multiple Stern-Gerlach apparatuses affects the angular momentum of the atoms.
To remind ourselves, in the classical picture, the angular momentum is defined as
Case 1: z-z
In the first case, we pass the beam through a SG apparatus with the magnetic field in the
Since no external force changes the angular momentum, all the atoms in the up-spin beam will remain up-spin. Thus, only one beam will be observed on the screen.
The experimental result aligns with this prediction.
Sakurai uses a visual diagram to represent the state of the beam, which I replicated below.
Case 2: z-x
In the second case, we pass the beam through a
It turns out that after passing through the
What does this mean?
One guess is that in the original beam, half of the atoms have both
The visual representation of this is shown below.
Case 3: z-x-z
Perhaps the most interesting case is when we do what we did in the second case and then pass the beams through another
- Pass through
-SG apparatus, splitting into and . - Block off
. - Pass through
-SG apparatus, splitting into and . - Block off
. - Pass through
-SG apparatus. What happens?
It turns out that the beam splits into two again, one being
The visual representation of this is shown below.
What this tells us is the following: when the
Polarization of Light
The Stern-Gerlach experiment has a direct analogy in the polarization of light. Light is an electromagnetic wave, and the electric field of the wave can oscillate in any direction perpendicular to the direction of propagation. The direction of oscillation is called the polarization of the light.
Note that we are using light purely as an analogy to the Stern-Gerlach experiment, in hopes of creating a mathematical framework that can describe both phenomena. In this analogy, we treat light purely classically as an electromagnetic wave. There is a lot to say about the quantum nature of polarization - watch this video for more. In fact, watch it even if you don't care about the quantum nature of polarization.
If a light beam is
It is often convenient to use complex notation to represent the electric field. Of course, the physical electric field is real, but the complex notation simplifies calculations.
We can also add a phase factor to the electric field:
A
More generally, an electric field can be written as a linear combination of
We can separate the exponential and the polarization vector:
The vector
In order to get
In a similar manner to the Stern-Gerlach experiment, we can pass light through a series of filters and observe the polarization of the light:
-
Case 1: Pass through
-filter, then -filter.In this case, the light will be
-polarized after the first filter. Then, no light will pass through the -filter, as it only allows -polarized light to pass through. Thus, the light will be completely blocked. -
Case 2: Pass through
-filter, then a -filter, then -filter.In this case, there will actually be light passing through the
-filter.
Let's examine the second case in more detail.
Denote the direction of the
SG Experiment | Light Experiment |
---|---|
After SG apparatus: Lose spin information | After filter: Lose polarization information |
The basis vectors
The key is that once light passes through an
More precisely speaking, we can write down the coordinate transformation:
Another way of thinking about it is that mathematically, a filter simply projects the Jones vector onto the basis vector of the filter.
Since we can represent the polarization of light as a vector as we have seen, perhaps we can use the same vector representation for the spin of particles. Of course, this means that the particles have a property that can be represented as a vector in a two-dimensional space.
We can write the spin of a particle as a vector in a two-dimensional space.
In the
Next, how do we represent spin in the
By symmetry, if we pass
Circular Polarization
Another type of polarization is circular polarization, where the electric field rotates in a circle.
To understand this better, let's get a deeper understanding of the Jones vector.
Recall that the Jones vector can be represented by combining the
Circular polarization occurs when there is a phase difference between the
With imaginary components, it is a bit harder to visualize the Jones vector. If we get the full expression for the electric field, we will see the following:
At the point
Using Euler's formula, we can expand it out and then take the real part:
In other words, the electric field rotates in a circle clockwise.
Go to the above visualizer and set the following values:
This wave is left-circularly polarized. If one points their left thumb in the direction of propagation, the electric field will rotate in the direction of the fingers.
Overall, the Jones vector for different types of polarization is summarized in the following table:
Polarization | Jones Vector |
---|---|
Linear in | |
Linear in | |
Linear at | |
Linear at | |
Linear at | |
Left-circular | |
Right-circular |
Circular-polarized light can be created by passing linear-polarized light through a quarter-wave plate, which introduces a phase difference between the
SG Experiment | Light Experiment |
---|---|
After SG apparatus: Lose spin information | After filter: Lose polarization information |
Circular-polarized light |
We can hence write down the
SU(2) States for Polarization
In the physical world, we transform the polarization of light using wave plates and polarizers. Mathematically, each of these devices corresponds to a transformation on the Jones vector. These transformations can be represented by matrices called Jones matrices.
Wave plates are usually birefringent crystals - crystals that have different refractive indices for different polarizations.
This means that light slows down differently depending on the direction, and hence have a different phase shift for different polarizations.
This causes the phase difference between the
A quater wave plate - one that introduces a quarter-cycle phase shift in the
This changes a diagonal-polarized wave
The interesting part of this occurs when we keep applying these transformations:
In other words, the quarter wave plate matrix allows us to rotate between the different types of polarization in the order
Shape of polarized light
Abstract polarization space
Now imagine that we have the same birefringent crystal, but this time we rotate it by
Mathematically, this is equivalent to applying the following transformations: (1) rotate the light by
An important thing to note is that the
In other words, the rotated quarter wave plate matrix allows us to rotate between the different types of polarization in the order
Now we have two different matrices that allow us to rotate between the different types of polarization. If we put their circles together, we form a sphere as follows:
The transformations are represented by the matrices:
Blue transformation | Yellow transformation | Green transformation |
---|---|---|
? |
As you can see, by placing the different polarization states on a sphere, we can identify another transformation that we're missing. This sphere is called the Poincaré sphere. The third transformation is the following:
It turns out that these three matrices all belong to the Unitary Group of
If a matrix has only real entries, then its adjoint is the same as its transpose.
Given that a, say,
These matrices are called orthogonal matrices, and the group of all
We can see that the
Hence, the determinant can be any complex number with a magnitude of
We also know that because the global phase shift does not matter, there are multiple matrices that can represent the same transformation.
For example,
If the determinant of a matrix is
Summary of the Groups
Below is a diagram that summarizes the different groups of matrices -
Preserves the magnitude of the | |
---|---|
Special Unitary Group of | |
Preserves the magnitude of real vectors; | |
Special Orthogonal Group of | |
Jones Vectors are Spinors
The Jones vectors we have been using to represent the polarization of light are actually mathematical objects called spinors.
To see why, consider the horizontal and vertical polarization vectors
In physical space, these are
This is the characteristic of a spinor - it requires a
Summary and Next Steps
This was quite a tough "introduction" to quantum mechanics. We started with the Stern-Gerlach experiment, which led us to the concept of spin states. Then, we postulated that the spin states are similar to the polarization states of light.
Here are the key points to remember:
- The Stern-Gerlach experiment is a thought experiment that separates particles based on their spin.
- From the SG experiment, it is clear that angular momentum is quantized. That is, particles can only have certain values of angular momentum.
- Performing sequential SG experiments in different directions leads to the conclusion that spin states can only exist in one direction. That is, when we measure or set the spin of a particle in one direction, the spin in the other directions is lost.
- The spin states are similar to the polarization states of light. The
states are similar to the and polarizations, while the states are similar to the and polarizations. - Mathematically speaking, the two spin states are members of a two-dimensional complex vector space and act as a basis for the space.
- The
states are similar to circularly polarized light. - The polarization of light can be represented by the Jones vector, which can be transformed by matrices called Jones matrices. They can be represented in an abstract space known as the Poincaré sphere.
- Jones matrices are elements of the
group, the unitary group of matrices. - The
group is similar to the group, the special orthogonal group of matrices. - Jones vectors rotate twice in abstract polarization space as they do in physical space, making them spinors.
While it may seem unnecessary to go through all of the matrix math and group theory, it gives us a huge headstart in understanding quantum mechanics. It will become very important when we study angular momentum in detail.
In the next chapter, we will apply the different matrix transformations back to the Stern-Gerlach experiment to see how they affect the spin states of particles.
References
- J.J. Sakurai, "Modern Quantum Mechanics", section 1.1 ("The Stern-Gerlach Experiment").
- Eigenchris "Spinors for Beginners", videos 2 and 3.
Appendix: Magnetism Review
Magnetic Moment
The magnetic moment of a particle is a measure of the strength and orientation of its magnetic field.
It is a vector quantity, denoted by
Mathematically, the magnetic moment is defined as:
where
If one needs more detail, virtually any introductory electromagnetism textbook will have a section on magnetic moments.